Phone-level pronunciation scoring and assessment for interactive language learning

نویسندگان

  • Silke M. Witt
  • Steve J. Young
چکیده

This paper investigates a method of automatic pronunciation scoring for use in computer-assisted language learning (CALL) systems. The method utilises a likelihood-based `Goodness of Pronunciation' (GOP) measure which is extended to include individual thresholds for each phone based on both averaged native con®dence scores and on rejection statistics provided by human judges. Further improvements are obtained by incorporating models of the subjectÕs native language and by augmenting the recognition networks to include expected pronunciation errors. The various GOP measures are assessed using a specially recorded database of non-native speakers which has been annotated to mark phone-level pronunciation errors. Since pronunciation assessment is highly subjective, a set of four performance measures has been designed, each of them measuring di€erent aspects of how well computer-derived phone-level scores agree with human scores. These performance measures are used to cross-validate the reference annotations and to assess the basic GOP algorithm and its re®nements. The experimental results suggest that a likelihood-based pronunciation scoring metric can achieve usable performance, especially after applying the various enhancements. Ó 2000 Elsevier Science B.V. All rights reserved.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The SRI EduSpeak System: Recognition and Pronunciation Scoring for Language Learning

The EduSpeak system is a software development toolkit that enables developers of interactive language education software to use state-of-the-art speech recognition and pronunciation scoring technology. We first report results on the application of adaptation techniques to recognize both native and nonnative speech in a speaker-independent manner. We discuss our pronunciation scoring paradigm an...

متن کامل

Performance Measures for Phone - Level Pronunciationteaching in Calls

This work presents a general development framework for automatic pronunciation assessment within computer-assisted language learning (CALL) together with several reenements of a previously described pronunciation scoring method. This method utilises a likelihood-based`Goodness of Pronunciation' (GOP) measure which in this work has been extended to include individual thresholds for each phone ba...

متن کامل

Pronunciation Barriers and Computer Assisted Language Learning (CALL): Coping the Demands of 21st Century in Second Language Learning Classroom in Pakistan

Pronunciation of English language is a very important sub-skill of speaking module in second language learning process. However, it is ignored, neglected, and even never gotten least attention by the teachers, administrators, and stakeholders especially in Pakistan. Grammar, vocabulary, and the other linguistic skills such as reading and writing are emphasized whereas pronunciation has never be...

متن کامل

Automatic assessment of children speech to support language learning

Focus of this work are pattern recognition related aspects of computer assisted pronunciation training (CAPT) for second language learning. An overview of commercial systems shows that pronunciation training is being addressed by the growing eld of computer assisted language learning only to a small extend, although in the state-of-the-art section a number of such approaches for automatic asses...

متن کامل

Automatic pronunciation scoring of specific phone segments for language instruction

The aim of the work described in this paper is to develop methods for automatically assessing the pronunciation quality of specific phone segments uttered by students learning a foreign language. From the phonetic time alignments generated by SRI's Decipher™ HMM-based speech recognition system, we use various probabilistic models to produce pronunciation scores for the phone utterance. We evalu...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Speech Communication

دوره 30  شماره 

صفحات  -

تاریخ انتشار 2000